Name | Version | Summary | date |
---|---|---|---|
shtec-rlhf | 0.0.3.dev0 | shtec-rlhf: Safe Reinforcement Learning from Human Feedback | 2024-05-20 15:34:00 |
trl | 0.8.4 | Train transformer language models with reinforcement learning. | 2024-04-17 15:16:50 |
hour | day | week | total |
---|---|---|---|
44 | 1377 | 9840 | 213015 |